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(54) Method and apparatus for dynamically deoptimizing compiled activations 



(57) Methods and apparatus for dynamically deop- 
timizing a frame in a control stack during the execution 
of a computer program are disclosed. The described 
methods are particularly suitable for use in computer 
systems that are arranged to execute both interpreted 
and compiled byte codes. According to one aspect of 
the present invention, a computer-implemented method 
for deoptimizing a compiled method includes creating a 
data structure. The data structure, which is separate 
from the control stack, is arranged to store information 
relating to the compiled method. A reference indicator, 
such as a pointer, is created to associate the data struc- 
ture with the frame. The method, which is compiled to a 
first state of optimization, is then deoptimized to a sec- 
ond state of optimization, and the method in the first 
state of optimization may be discarded, thereby deopti- 
mizing the frame. When control returns to the deopti- 
mized frame, a migration routine creates at least one 
new stack frame, and execution may continue using the 
method in the second state of optimization. 
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Description 

BACKGROUND OF THE INVENTION 

1 . Field of Invention 

[0001] The present invention relates generally to 
nnethods and apparatus for deoptimizing connpiled acti- 
vations in software applications. More particularly, the 
present invention relates to nnethods and apparatus for 
performing eager deoptinnization of connpiled activa- 
tions during the overall execution of a connputer pro- 
grann. 

2. Description of Relevant Art 

[0002] Connputer systenns are often linked across a 
network; e.g., local area networks, intranets and inter- 
nets, of connputer systenns such that they nnay share re- 
sources. Share resources often include software appli- 
cations. In general, software applications, or connputer 
progranns, nnay be delivered in different fornnats to dif- 
ferent connputer systenns, due to the fact that each conn- 
puter systenn requires software applications to be in a 
specific fornnat. Alternatively, the connputer progranns 
nnay be delivered to a connputer systenn in a nnachine- 
independent fornn, i.e., as byte codes, in order to enable 
one fornn of a connputer progrann to be utilized by nnany 
different connputer systenns. 

[0003] When connputer progranns are delivered in a 
machine-independent form, the programs may be inter- 
preted directly or the programs may be translated into 
machine-dependent code, i.e., "machine code." Pro- 
grams which are interpreted directly occupy less space 
in a computer system than programs which are translat- 
ed into machine code. However, programs which are in- 
terpreted directly have slower execution speeds than 
programs which are translated into machine code, in 
most cases. As such, the determination of whether or 
not to interpret a computer program directly in lieu of 
translating the computer program into machine code, is 
often based on the relative importance of space in rela- 
tion to execution speed. 

[0004] Some computer systems may be arranged to 
support both interpreted code and compiled, or ma- 
chine, code. During the course of executing a program 
on a system which supports both interpreted and com- 
piled code, it may sometimes be beneficial to eliminate, 
or delete, a compiled method. Eliminating a compiled 
method, which contains compiled activations, generally 
frees space within the computer system. Hence, when 
additional space is needed on a computer system, a 
compiled method may be deleted, and replaced with an 
interpreter code equivalent, because an interpreted 
method occupies less space than its compiled code 
equivalent. 

[0005] In addition, compiled code may have to be dis- 
carded because the code was based on assumptions 



that are no longer valid. By way of example, compiled 
code may be discarded because a new class was load- 
ed, or because program code was changed. When the 
assumptions are no longer valid, i.e., no longer hold, if 
5 the computer system continues to execute the compiled 
code, erroneous results may occur. Hence, the compu- 
ter system must generally cease execution of the com- 
piled code to guard against erroneous results, even in 
the event that the compiled code is currently executing 
70 in one or more activation records. 

[0006] Each method in a computer program, or appli- 
cation, is typically associated with at least one frame on 
a computer control stack. As such, when a compiled 
method is deleted, any frames, i.e., compiled frames, 
75 associated with the compiled method must essentially 
be transformed into interpreter frames. In general, com- 
piled frames may also be transformed into interpreter 
frames when compiled frames are invalidated, as de- 
scribed above. Transforming such invalid compiled 
20 frames into interpreter frames essentially converts 
invalid frames into valid frames, as will be appreciated 
by those skilled in the art. 

[0007] A frame is an activation record that is stored 
on the control stack, as is well known in the art. The 
25 frame pertains to a method, and is arranged to store in- 
formation for the execution of the method. Information 
stored in a frame may include control state variables, 
local variables and an expression stack. A control stack 
is a stack that is utilized during program execution to 
30 store frames for methods, or functions, in their sequen- 
tial calling order. When a method is called, a frame for 
the method is pushed onto the control stack. Subse- 
quently, when the method terminates, the frame asso- 
ciated with the method is popped off the stack and the 
35 function for the new frame on the top of the control stack 
resumes execution, i.e., regains control. 
[0008] A compiled frame may not be transformed into 
an interpreter code equivalent until the compiled frame 
is at the top of the control stack, due to the fact that the 
40 interpreter code equivalent often requires more space 
on the control stack than required by the compiled 
frame. Hence, a method may not be decompiled or 
deoptimized until the frame which corresponds to the 
method is at the top of the control stack, since additional 
45 Space may not be allocated in the middle of the control 
stack. As such, when the compiled frame is located in 
the middle of the control stack, e.g., the compiled frame 
is not the topmost frame in the control stack, the com- 
piled frame may not be replaced by corresponding in- 
50 terpreter frames. 

[0009] Figure 1 is a diagrammatic representation of a 
control stack which includes compiled frames. A control 
stack 104 includes frames 108 which, as described 
above, are associated with methods. By way of exam- 
55 pie, frame 1 08b is a record which contains activation in- 
formation associated with a compiled method 116. A 
stack pointer 112 identifies the location of the topmost 
frame 108 in stack 104, i.e., frame 108c, which is the 



20 



25 



2 



3 



EP 0 908 820 A2 



4 



frame which is currently executing. Arrow 114 indicates 
the direction in which stack 104 grows. Therefore, as 
indicated by arrow 1 1 4., frame 1 08c is the topmost frame 
in stack 104 and, hence, is the frame with control. 
[0010] Compiled method 116, and frame 108b, con- 
tain information which may be used to create an inter- 
preter code equivalent of compiled method 116. The in- 
terpreter code equivalent of compiled method 116 may 
generally be accessed and, hence, obtained at anytime. 
However, interpreter frames associated with the inter- 
preter code equivalent may occupy a different amount 
of space on stack 104 than frame 108b. Since frames 
which contain the interpreter code equivalent of com- 
piled method 116, i.e., interpreter frames, may occupy 
more space on stack 1 04 than frame 1 08b, the interpret- 
er frames may not be inserted into stack 1 04 until frame 
108b is popped to the top of stack 104, as mentioned 
above. In other words, frame 108b must essentially be 
the frame with control in stack 104 before method 116 
and frame 108b may be deoptimized. 
[0011] As described above, frame 108b may not be 
deoptimized when frame 108b is in the middle of stack 
104. Accordingly, when it is determined that compiled 
method 116 is to be deleted, the deletion of compiled 
method 116 and, hence, the deoptimization of frame 
1 08b must be delayed. Therefore, compiled method 1 1 6 
may be marked for deletion, and frame 108b may be 
marked for deoptimization, such that when frame 108b 
is the topmost frame within stack 1 04, compiled method 
116 is deleted, while frame 108b is deoptimized. 
[0012] With reference to Figure 2, the deoptimization 
of frame 108b in stack 104 of Figure 1 will be described. 
When frame 1 08c, as shown in Figure 1 , is popped from 
stack 104, then frame 108b is no longer in a middle po- 
sition within stack 104. In other words, frame 108b be- 
comes the topmost frame in stack 1 04. Frame 1 08b may 
then be replaced by interpreter frame 108b', which in- 
cludes an interpreter activation of compiled method 116. 
Stack pointer 1 1 2 may then be set to point to interpreter 
frame 108b'. Once interpreter frame 108b' is pushed on- 
to stack 104, compiled method 116 may be deleted, as 
indicated by deleted compiled method 116'. Space 
which was previously occupied by compiled method 1 1 6 
is then available to be reallocated for other uses. 
[0013] Frame 108b may also be deoptimized just be- 
fore frame 108b becomes the topmost frame in stack 
104. In other words, frame 108b may be deoptimized 
j ust as frame 1 08c is about to retu rn and be popped from 
stack 104. The deoptimization of frame 108b just before 
frame 1 08b becomes the topmost frame is known as "la- 
zy" deoptimization. In lazy deoptimization, deoptimiza- 
tion of a particular compiled frame is delayed until the 
only frame which is higher in a stack than the particular 
compiled frame is about to return to the particular com- 
piled frame. As such, in lazy deoptimization, the deop- 
timization of a frame occurs just as that frame is about 
to resume execution. 

[0014] In general, at least some of the computer re- 



sources, e.g., memory space, allocated to a compiled 
method may be slated for reallocation when insufficient 
resources are available for other purposes. When re- 
sources allocated to the compiled method are to be re- 

5 allocated or redistributed, the compiled method must be 
deleted, or otherwise deoptimized, in order to free the 
resources. Therefore, deferring the deletion of the com- 
piled method is often undesirable, since resources 
which may be used immediately for other purposes will 

70 not available until the compiled method is deleted. That 
is, since the length of delays prior to deletion are often 
relatively long, depending on factors such as the stack 
location of a frame that is to be deoptimized, the process 
of decompiling a compiled method and deoptimizing an 

75 associated compiled frame may be inefficient. As such, 
mechanisms that essentially eliminate the delays asso- 
ciated with decompilation and deoptimization are de- 
sired. That is, methods and apparatus which improve 
the efficiency of dynamic decompilation of compiled 

20 methods and dynamic deoptimization of compiled 
frames associated with a computer program would be 
desirable. 

SUMMARY OF THE INVENTION 

25 

[001 5] Methods and apparatus for dynamically deop- 
timizing a frame in a control stack during the execution 
of a computer program are disclosed. The described 
methods are particularly suitable for use in computer 

30 systems that are arranged to execute both interpreted 
and compiled byte codes. According to one aspect of 
the present invention, a computer-implemented method 
for deoptimizing a compiled method includes creating a 
data structure. The data structure, which is separate 

35 from the control stack, is arranged to store information 
relating to the compiled method and the frame on the 
control stack that is to be deoptimized. A reference in- 
dicator, such as a pointer, is created to associate the 
data structure with the frame. The compiled method may 

40 then be discarded immediately. In one embodiment, the 
data structure is machine-independent, and the decom- 
pilation of the compiled method includes deleting the 
compiled method. 

[0016] According to another aspect of the present in- 
45 vention, a method for dynamically deoptimizing a first 
frame, which is associated with a compiled method 
which includes a compiled activation, on a control stack 
includes decompiling the compiled method. The com- 
piled method may be decompiled when the first frame 
50 is located beneath a second frame on the control stack. 
Decompiling the compiled method includes decompiling 
the compiled activation to create an interpreter equiva- 
lent of the compiled activation. The second frame is 
eventually popped from the control stack, and an inter- 
ns prefer frame is created from the interpreter equivalent 
of the compiled activation. The interpreter frame is then 
pushed onto the stack in place of the first frame. In one 
embodiment, the interpreter equivalent of the compiled 
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activation is stored in a temporary data structure tinat is 
associated witin tine first frame. In such an embodiment, 
the data structure may be deleted after the interpreter 
frame is created. 

[0017] According to still another aspect of the present 
invention, a method for dynamically deoptimizing a com- 
piled method associated with a first frame selected from 
a plurality of frames on a control stack includes creating 
a data structure that is referenced by the first frame. The 
data structure, which is separate from the control stack, 
contains interpreter information relating tothe first frame 
and method information relating to the compiled meth- 
od. The compiled method is deleted, and the interpreter 
information is unpacked when the stack pointer associ- 
ated with the control stack identifies the first frame as 
the current frame. Unpacking the interpreter information 
involves the creation of an interpreter frame. The inter- 
preter frame is pushed onto the control stack to replace 
the first frame. 

[001 8] These and other advantages of the present in- 
vention will become apparent upon reading the following 
detailed descriptions and studying the various figures of 
the drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0019] The invention may best be understood by ref- 
erence to the following description taken in conjunction 
with the accompanying drawings in which: 
[0020] Figure 1 is a diagrammatic representation of a 
control stack which includes compiled frames. 
[0021] Figure 2 is a diagrammatic representation of 
control stack 104 of Figure 1 after an interpreter frame 
has replaced a compiled frame. 
[0022] Figure 3a is a diagrammatic representation of 
a control stack which includes compiled frames in ac- 
cordance with an embodiment of the present invention. 
[0023] Figure 3b is a diagrammatic representation of 
control stack 304 of Figure 3a after a compiled frame 
has been deoptimized and a virtual frame array has 
been created in accordance with an embodiment of the 
present invention. 

[0024] Figure 3c is a diagrammatic representation of 
control stack 304 of Figure 3b after deoptimized frame 
308b' becomes the topmost frame in accordance with 
an embodiment of the present invention. 
[0025] Figure 3d is a diagrammatic representation of 
control stack 304 of Figure 3c after interpreter frames 
replace the deoptimized frame in accordance with an 
embodiment of the present invention. 
[0026] Figure 4 is a process flow diagram which illus- 
trates the steps associated with eagerly deoptimizing a 
compiled frame in accordance with an embodiment of 
the present invention. 

[0027] Figure 5 is a process flow diagram which illus- 
trates the steps associated with unpacking a virtual 
frame array in accordance with an embodiment of the 
present invention. 



[0028] Figure 6 is a diagrammatic representation of a 
control stack which includes interpreterf rames and a mi- 
grating frame in accordance with an embodiment of the 
present invention. 
5 [0029] Figure 7 illustrates a typical, general purpose 
computer system suitable for implementing the present 
invention. 

[0030] Figure 8 is a diagrammatic representation of a 
virtual machine which is suitable for implementing the 
70 present invention. 

DETAILED DESCRIPTION OF THE EMBODIMENTS 

[0031] During the execution of a computer program 

75 which is running in a computer system that supports 
both interpreted code and compiled code, the deletion 
of compiled methods is typically desirable when re- 
sources within the computer system are to be more ef- 
ficiently allocated, as described above. Generally, the 

20 deoptimization of a compiled method and, further, the 
deoptimization of an associated compiled frame on a 
control stack, are delayed until the associated compiled 
frame is either the topmost frame on the stack or about 
to become the topmost frame on the stack. Since the 

25 deoptimization of a compiled frame may involve the gen- 
eration of interpreter frames which occupy more space 
on the stackthan the compiled frame occupied, the com- 
piled frame is not deoptimized while it is located some- 
where in the middle of the stack due to space con- 

30 straints. The delay in the deletion of the compiled meth- 
od delays the availability of resources within the com- 
puter system. 

[0032] Allowing the deoptimization of a compiled 
method and an associated compiled frame to occur re- 

35 gardless of the stack location of the compiled frame al- 
lows the compiled method to be discarded essentially 
without delay. In order to allow the deoptimization of a 
compiled method and a corresponding compiled frame 
to occur without being deferred until control is either re- 

40 turned to or about to be returned to the compiled frame, 
a structure may be created to temporarily hold interpret- 
er information associated with the compiled method. 
Such a deoptimization may be considered to be an "ea- 
ger" deoptimization, as it does not require that deopti- 

45 mization be deferred until a frame which is to be deop- 
timized either has, or is about to gain, control. 
[0033] By creating a data structure to hold interpreter 
information off of the stack, a compiled frame, which is 
to be replaced by frames created from the interpreter 

50 information, and a corresponding method may be deop- 
timized while the compiled frame is in the middle of a 
stack. In other words, the compiled method associated 
with the compiled frame may be deleted, or otherwise 
deoptimized, without deferring the deletion until the 

55 compiled frame reaches the top of the stack. Since in- 
terpreter frames created from the interpreter information 
may occupy more space on the stack than the deopti- 
mized frame originally occupied, the use of a data struc- 
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ture enables deoptimization to occur eagerly without en- 
countering issues involving the sizing of interpreter 
frames in the middle of the stack. That is, the data struc- 
ture temporarily holds information which is intended to 
be unpacked to create interpreter frames when the 
deoptimized frame eventually reaches the top of the 
stack. 

[0034] Figure 3a is a diagrammatic representation of 
a control stack which includes a frame that is associated 
with a compiled method in accordance with an embod- 
iment of the present invention. A stack 304 includes a 
plurality of optimized, or compiled^ frames 308. A stack 
pointer 312 identifies a current frame, e.g., frame "f4" 
308d, in stack 304. Current frame 308d is the frame 
which has control within an overall computer system that 
includes stack 304. That is. current frame 308d is the 
topmost frame within stack 304. In the described em- 
bodiment, a frame "f2" 308b, referred to herein and be- 
low as frame 308b, is associated with a compiled meth- 
od 316. Specifically, frame 308b includes state informa- 
tion pertaining to compiled activations included in com- 
piled method 31 6. 

[0035] In general, compiled method 316, which is 
compiled to a given level of optimization; may be deop- 
timized such that compiled method 31 6 is at a lower lev- 
el of optimization. In one embodiment, when compiled 
method 316 is to be deleted, then the compiled activa- 
tions in compiled method 316 may be replaced by byte 
code equivalents, e.g., interpreter activations. That is, 
frame 308b, which is optimized, may be replaced by an 
interpreted frame, or frames. 

[0036] Compiled method 31 6 may be deoptimized for 
a variety of different reasons. The different reasons in- 
clude, but are not limited to, the invalidation of compiled 
method 316 due to changes in program code, and the 
need for additional computer memory space. By deop- 
timizing compiled method 316 or, more specifically, 
frame 308b, frame 308b may be made valid when com- 
piled method 316 is invalid. Further, as compiled code 
generally occupies more computer memory space than 
interpreted code, deoptimizing compiled method 316 
typically releases computer memory space. 
[0037] I nf ormation relating to an interpreted represen- 
tation of compiled method 316 may be contained within 
both compiled method 316 and frame 308b. In general, 
compiled method 316 contains static information while 
frame 308b contains dynamic information, as will be ap- 
preciated by those skilled in the art. The information is 
used, as will be described below with reference to Figure 
4, in the creation of a data structure that is essentially 
arranged to hold interpreter activations. The creation of 
the data structure that holds interpreter activations al- 
lows frame 308b to be deoptimized. Figure 3b is a dia- 
grammatic representation of stack 304 after a data 
structure that holds interpreter activations is created in 
accordance with an embodiment of the present inven- 
tion. A data structure 324, which is called a virtual frame 
("vframe") array, holds interpreter activations 328 that 



represent compiled method 316 of Figure 3a. 
[0038] As shown, compiled method 316 is represent- 
ed by three interpreter activations 328. It should be ap- 
preciated, however, that the number of interpreter acti- 

5 vations 328 that represent compiled method 316 may 
generally vary widely depending upon a variety of dif- 
ferent factors. Such factors may include, but are not lim- 
ited to, the size of compiled method 316, i.e., the number 
of compiled activations associated with compiled meth- 

70 od 316, and optimizations, such as inlining, performed 
by the compiler. 

[0039] A pointer 332 from deoptimized frame 308b' 
identifies vframe array 324. In other words, pointer 332 
is arranged to indicate that vframe array 324 is associ- 

75 ated with deoptimized frame 308b'. In general, vframe 
array 324 is associated with only a single deoptimized 
frame, such as deoptimized frame 308b'. However, in 
some embodiments, a plurality of deoptimized frames, 
both adjacent and non-adjacent, may share a single 

20 vframe array. 

[0040] The creation of vframe array 324 may gener- 
ally occur at any time. In particular, vframe array 324 
may be created while deoptimized frame 308b' is in the 
middle of stack 304. Once vframe array 324 is created, 

25 compiled method 31 6 is deleted, as indicated by deleted 
compiled method 31 6'. The deletion of compiled method 
316 allows system resources, e.g., memory space, as- 
sociated with compiled method 316 to be reallocated. 
Specifically, compiled method 316 may be deleted be- 

30 fore deoptimized frame 308b' is at the top of stack 304, 
i.e., substantially at anytime before deoptimized frame 
308b' becomes the current frame. Therefore, system re- 
sources associated with compiled method 316 may be 
reallocated, and frame 308b of Figure 3a may become 

35 deoptimized frame 308b', whenever necessary. 

[0041] When stack pointer 312 indicates that deopti- 
mized frame 308b' is the current frame on stack 304, as 
shown in Figure 3c, then vframe array 324 is unpacked 
to create corresponding interpreter frames on stack 304, 

40 as shown in Figure 3d. Interpreter activation 328a is un- 
packed and essentially transformed into interpreter 
frame 328a', which is placed on stack 304. Similarly, in- 
terpreter activation 328b corresponds to interpreter 
frame 328b', and interpreter activation 328c corre- 

45 sponds to interpreter frame 328c'. Once interpreter 
frames 328' are pushed onto stack 304, stack pointer 
312 is set to identify interpreter frame 328c' as the cur- 
rent frame. In one embodiment, a system may addition- 
ally create a migrating frame 330 that is used to properly 

50 link all interpreterf rames 328. The steps associated with 
unpacking vframe array 324 and, hence, creating inter- 
preter frames 328' on stack 304 will be described below 
with reference to Figure 5. 

[0042] As previously mentioned, eagerly deoptimiz- 
55 ing an optimized frame through the use of a vframe array 
allows the deletion of a compiled method associated 
with the optimized frame to occur at substantially any 
time during the execution of a program. By not delaying 
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deoptimization of a frame until the frame has control, 
computer resources may be freed through deoptimiza- 
tion more efficiently. For example, when space is need- 
ed within a computer system, a compiled method may 
be slated for deletion. Allowing the associated frame to 
be deoptimized even while the frame is in the middle of 
a stack enables the compiled method to be substantially 
immediately deleted, thereby quickly freeing space with- 
in the computer system when space is needed. 
[0043] Referring next to Figure 4, the steps associat- 
ed with eagerly deoptimizing an optimized frame will be 
described in accordance with an embodiment of the 
present invention. In general, aframe which is optimized 
to a given level may be deoptimized to a less optimized 
level for any number of different reasons. While the rea- 
sons may be widely varied, such reasons may include, 
but are not limited to, the need to validate an invalid 
frame, the need to save space in computer memory, and 
the need to obtain a machine-independent control stack. 

[0044] The process of deoptimizing an optimized 
frame begins at step 402 in which the program counter 
associated with the optimized frame, or the frame which 
is to be deoptimized, is obtained. The program counter 
is obtained for use in accessing access information that 
is present in the compiled method which is associated 
with the optimized frame. Access information generally 
includes, but is not limited to, values or locations of 
source-level variables associated with the compiled 
method, as will be appreciated by those skilled in the art. 
[0045] The access information is used in step 404 to 
extract an interpreter state from the optimized frame for 
use in obtaining an interpreter activation. It should be 
appreciated that there may be multiple interpreter acti- 
vations which are associated with the optimized frame. 
Once interpreter activations associated with the opti- 
mized frame are obtained, the interpreter activations are 
migrated; e.g., placed, into a vframe array in step 406. 
As a result, substantially all information contained within 
a vframe array, both static and dynamic, may be used 
to eventually push a series of interpreter frames onto 
the stack, as will be discussed below with reference to 
Figure 5. 

[0046] After interpreter activations are stored into the 
vframe array a pointer to the vframe array is stored with- 
in the corresponding, e.g., optimized, frame in step 408. 
In general, the vframe array pointer may be stored in 
any fixed location within the optimized frame. By way of 
example, the vframe array pointer may be stored in the 
first local variable of the optimized frame, thereby ena- 
bling the vframe array pointer to be readily accessed. 
From step 408, process flow moves to step 410 where 
the return address of the frame that is being called by 
the frame which is being deoptimized is set to the ad- 
dress of the unpacking routine for the vframe array. The 
unpacking routine, which generally includes a migrating 
routine, will be described in more detail below with ref- 
erence to Figure 6. 



[0047] The compiled method from which access infor- 
mation was obtained may be deleted in step 412 after 
the return address of the frame that is being called, e. 
g., the "callee," is changed. In general, the return ad- 

5 dress of the callee, which may originally be set to the 
address of the compiled method, is changed to prevent 
the callee from returning to a non-existent; deleted com- 
piled method. As will be appreciated by those skilled in 
the art, in one embodiment, before the compiled method 

70 is allowed to be deleted, a determination may be made 
to ascertain whether other frames reference the com- 
piled method. If other frames reference the compiled 
method, then the other frames are typically also deop- 
timized before the compiled method is deleted. It should 

75 be appreciated that although the process of deoptimiz- 
ing a frame is shown to end after the return address is 
changed in step 410, the associated compiled method 
is deleted before the deoptimization process is truly 
completed. As mentioned above, the deletion of the 

20 compiled method does not generally occur until all 
frames associated with the compiled method have been 
processed, e.g., using steps 402 to 410. 
[0048] A deoptimized frame, as described above with 
respect to Figure 3c, is not physically replaced by the 

25 interpreter frames until the deoptimized frame becomes 
the current frame. In other words, interpreter frames 
which correspond to the deoptimized frame are not 
pushed onto the stack in place of the deoptimized frame 
until the stackpointer references the deoptimized frame. 

30 Replacing a deoptimized frame with an interpreter frame 
generally involves unpacking the vframe array refer- 
enced by the interpreter frame to obtain the interpreter 
frames. Figure 5 is a process flow diagram which illus- 
trates the steps associated with one method of unpack- 

35 ing a vframe array and placing interpreter frames un- 
packed from the vframe array onto a stack in accord- 
ance with an embodiment of the present invention. The 
process of replacing a deoptimized frame on a stack 
with corresponding interpreter frames begins at step 

40 502 in which the vframe array associated with the deop- 
timized frame is obtained, and the size of corresponding 
interpreter frames are computed. That is, using the 
vframe array of the deoptimized frame, the size of the 
interpreter frames which will replace the deoptimized 

45 frame is calculated. Calculating the size of interpreter 
frames enables space on the stack to be allocated to 
accommodate the interpreter frames. Each interpreter 
frame will generally have a size which is dependent up- 
on the size of its corresponding interpreter activation. 

50 [0049] In ordertoallocate space for interpreter frames 
on the stack, the stack pointer for the stack is modified 
in step 504. Modifying the stack pointer may include set- 
ting the stack pointer to point to a location further up the 
stack from the deoptimized frame. Typically, the stack 

55 pointer is set to identify a location on the stack which 
will eventually correspond to the location of a highest 
interpreter frame, associated with the deoptimized 
frame, within the stack. 
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[0050] After the stack pointer is changed in step 504, 
a stack frame for a migrating routine is created, and the 
migrating routine is invoked, in step 506. The migrating 
routine, as will be appreciated by those skilled in the art, 
is arranged to place interpreter activations onto the 
stack. Placing interpreter activations onto a stack typi- 
cally involves transforming the information from the 
vframe array into frames. In one embodiment, trans- 
forming interpreter activations into interpreter frames in- 
cludes creating a return address field and a frame point- 
er field that are associated with each interpreter activa- 
tion. Invoking the migrating routine may also change the 
stack pointer further, as for example to identify the stack 
frame associated with the migrating routine. 
[0051] Within the migrating routine, interpreter activa- 
tions are migrated to create interpreter frames on the 
stack. Specifically, each interpreter activation "i" is mi- 
grated to create an interpreter frame on the stack. The 
number of interpreter frames "N" that are to be created 
is dependent upon the number of interpreter activations 
included in the vframe array. 

[0052] When the migrating routine is invoked, process 
flow moves from step 506 to steps 508 and 510, which 
are a representation of a loop in which all interpreter ac- 
tivations in the vframe array are individually migrated to 
create a corresponding interpreter frame on a stack. In 
other words, interpreter frames corresponding to inter- 
preter activations are pushed onto the stack. The loop 
continues until all interpreter activations have been mi- 
grated, at which point the process of unpacking a vframe 
array and placing interpreter frames onto a stack is com- 
pleted. 

[0053] In general, creating interpreter frames on a 
stack involves setting various addresses and pointers 
associated with the interpreter frames. The addresses 
and pointers which are set, or updated, may vary widely. 
By way of example, a frame pointer associated with an 
interpreter frame may be reset to reference another in- 
terpreter frame. A return address of an interpreter frame 
may also be set to properly reference the caller of the 
interpreter frame. Figure 6 is a diagrammatic represen- 
tation of a control stack which includes interpreter 
frames in accordance with an embodiment of the 
present invention. In other words, Figure 6 is a diagram- 
matic representation of a control stack, after an eager 
deoptimization process has been completed, and inter- 
preter frames have been pushed onto the stack. Control 
stack 602 includes a compiled frame 606 which called 
another compiled frame (not shown) that was replaced 
by interpreted frames 610 after deoptimization. 
[0054] A frame, e.g., compiled frame 606, generally 
includes a "data portion" 608 which contains program 
data, including local variables and parameters. Com- 
piled frame 606 also includes a frame pointer 614 and 
a return address 618. Frame pointer 614 identifies the 
links associated with compiled frame 606. In other 
words, frame pointer 614 identifies a frame, e.g., inter- 
preter frame 61 Oa, that is referenced by compiled frame 



606. Return address 618 specifies the address location 
of the frame on stack 602 to which compiled frame 606 
returns. Return address 61 8 specifies the address of the 
frame which called compiled frame 606, i.e., return ad- 

5 dress 618 specifies the current program counter of the 
caller of compiled frame 606. As will be appreciated by 
those skilled in the art, frame pointer 61 4 and return ad- 
dress 618 represent control information that is at least 
partially dependent upon the configuration of an asso- 

70 ciated interpreter and an associated compiler. Frames 
within stack 602 may also generally include various oth- 
er pointers. 

[0055] When interpreter frames 610 are pushed onto 
stack 602, frame pointers 622 and return addresses 626 

75 of interpreter frames 610 may be set. That is, links as- 
sociated with interpreter frames 610 may be estab- 
lished. As previously mentioned, frame pointers 622 
may be set to identify frames called by interpreter 
frames 610. By way of example, frame pointer 622a of 

20 interpreter frame 61 Oa may be set to identify interpreter 
frame 610b. Information relating to frame pointers 622 
and return addresses 626 is generally dependent upon 
the particular location of each interpreter frame on stack 
602, and may be readily computed based on the func- 

25 tionality of the interpreter. Hence, such information is 
typically not included in a corresponding vframe array 
Information contained in the vframe array is unpacked 
into data portions 624 of interpreter frames 610. 
[0056] Topmost interpreter frame 6 10c is set up as the 

30 caller of a migrating frame 630. Migrating frame 630, 
which includes a frame pointer 634 and a return address 
638, is associated with a migrating routine that is ar- 
ranged to set up interpreter frames 610 on stack 602, 
as described above. In other words, migrating frame 630 

35 is essentially arranged to allocate frames 61 0 on stack 
602 such that when the migrating routine returns, the 
control will be within interpreter frames 610. The migrat- 
ing routine associated with migrating frame 630 also 
sets frame pointers 622 and return addresses 626 to ref- 

40 erence their appropriate callees and callers using meth- 
ods which are well known to those skilled in the art. Re- 
turn address 638 is set to the program counter of the 
caller of migrating frame 630, while frame pointer 634 is 
set to identify topmost interpreter frame 610c. 

45 [0057] As shown, migrating frame 630 is identified by 
a stack pointer 61 2 and is, hence, the frame with control 
in control stack 602. A frame pointer "fp" 613 identifies 
the location of control information within the frame iden- 
tified by stack pointer 612, e.g., frame pointer 634 in mi- 

so grating frame 630. 

[0058] While vframe arrays are essentially machine 
independent, as previously mentioned, interpreter 
frames 610 are not necessarily machine independent. 
Therefore, the migrating routine associated with migrat- 
es ing frame 630 is essentially used to add machine de- 
pendent information to information obtained from a 
vframe array. Using the migrating routine, deoptimiza- 
tion may be performed without needing a second control 
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stack or thread for the deoptimization process. As will 
be appreciated by those skilled in the art, the migrating 
routine nnay also execute on a different stack, /.a, a sec- 
ond control stack. 

[0059] The present invention nnay generally be imple- 
nnented on any suitable computer system. Specifically, 
deoptimization as described above may be accom- 
plished using any suitable virtual machine, such as the 
virtual machine described below with respect to Figure 
8. Figure 7 illustrates a typical, general purpose compu- 
ter system suitable for implementing the present inven- 
tion. The computer system 730 includes any number of 
processors 732 (also referred to as central processing 
units, or CPUs) that are coupled to memory devices in- 
cluding primary storage devices 734 (typically a read on- 
ly memory, or ROM) and primary storage devices 736 
(typically a random access memory, or RAM). 
[0060] Computer system 730 or, more specifically, 
CPU 732, may be arranged to support a virtual machine, 
as will be appreciated by those skilled in the art. One 
example of a virtual machine that is supported on com- 
puter system 730 will be described below with reference 
to Figure 8. As is well known in the art, ROM acts to 
transfer data and instructions uni-directionally to the 
CPU 732, while RAM is used typically to transfer data 
and instructions in a bi-directional manner. CPU 732 
may generally include any number of processors. Both 
primary storage devices 734, 736 may include any suit- 
able computer-readable media. A secondary storage 
medium 738, which is typically a mass memory device, 
is also coupled bi-directionally to CPU 732 and provides 
additional data storage capacity. The mass memory de- 
vice 738 is a computer-readable medium that may be 
used to store programs including computer code, data, 
and the like. Typically mass memory device 738 is a 
storage medium such as a hard disk or a tape which is 
generally slower than primary storage devices 734, 736. 
Mass memory storage device 738 may take the form of 
a magnetic or paper tape reader or some other well- 
known device. It will be appreciated that the information 
retained within the mass memory device 738, may, in 
appropriate cases, be incorporated in standard fashion 
as part of RAM 736 as virtual memory. A specific primary 
storage device 734 such as a CD-ROM may also pass 
data uni-directionally to the CPU 732. 
[0061] CPU 732 is also coupled to one or more input/ 
output devices 740 that may include, but are not limited 
to, devices such as video monitors, track balls, mice, 
keyboards, microphones, touch-sensitive displays, 
transducer card readers, magnetic or paper tape read- 
ers, tablets, styluses, voice or handwriting recognizers, 
or other well-known input devices such as, of course, 
other computers. Finally, CPU 732 optionally may be 
coupled to a computer or telecommunications network, 
e.g., a local area network, an internet network or an in- 
tranet network, using a network connection as shown 
generally at 712. With such a network connection, it is 
contemplated that the CPU 732 might receive informa- 



tion from the network, or might output information to the 
network in the course of performing the above-de- 
scribed method steps. Such information, which is often 
represented as a sequence of instructions to be execut- 

5 ed using CPU 732, may be received from and outputted 
to the network, for example, in the form of a computer 
data signal embodied in a carrier wave. The above-de- 
scribed devices and materials will be familiar to those of 
skill in the computer hardware and software arts. 

10 [0062] As previously mentioned, a virtual machine 
may execute on computer system 730. Figure 8 is a di- 
agrammatic representation of a virtual machine which 
is supported by computer system 730 of Figure 7, and 
is suitable for implementing the present invention. When 

^5 a computer program, e.g., a computer program written 
in the Java™ progranming language developed by Sun 
Microsystems of Palo Alto, California, is executed, 
source code 810 is provided to a compiler 820 within a 
compile-time environment 805. Compiler 820 translates 

20 source code 810 into byte codes 830. In general, source 
code 810 is translated into byte codes 830 at the time 
source code 810 is created by a software developer. 
[0063] Byte codes 830 may generally be reproduced, 
downloaded, or otherwise distributed through a net- 

25 work, e.g., network 71 2 of Figure 7, or stored on a stor- 
age device such as primary storage 734 of Figure 7. In 
the described embodiment, byte codes 630 are platform 
independent. That is, byte codes 830 may be executed 
on substantially any computer system that is running a 

30 suitable virtual machine 840. By way of example, in a 
Java™ environment, byte codes 830 may be executed 
on a computer system that is running a Java™ virtual 
machine. 

[0064] Byte codes 830 are provided to a runtime en- 
35 vironment 835 which includes virtual machine 840. 
Runtime environment 835 may generally be executed 
using a processor such as CPU 732 of Figure 7. Virtual 
machine 840 includes a compiler 842, an interpreter 
844, and a runtime system 846. Byte codes 830 may 
40 generally be provided either to compiler 842 or interpret- 
er 844. 

[0065] When byte codes 830 are provided to compiler 
842, methods contained in byte codes 830 are compiled 
into machine instructions, as described above. On the 

45 Other hand, when byte codes 830 are provided to inter- 
preter 844, byte codes 830 are read into interpreter 844 
one byte code at a time. Interpreter 844 then performs 
the operation defined by each byte code as each byte 
code is read into interpreter 844. In general, interpreter 

50 844 processes byte codes 830 and performs operations 
associated with byte codes 830 substantially continu- 
ously. 

[0066] When a method is called from an operating 
system 860, if it is determined that the method is to be 
55 invoked as an interpreted method, runtime system 846 
may obtain the method from interpreter 844. If, on the 
other hand, it is determined that the method is to be in- 
voked as a compiled method, runtime system 846 acti- 



8 



15 



EP 0 908 820 A2 



16 



vates compiler 842. Compiler 842 then generates ma- 
chirne instructiorns from byte codes 830, and executes 
the machine-language instructions. In general, the ma- 
chine-language instructions are discarded when virtual 
machine 840 terminates. 

[0067] Although only a few embodiments of the 
present invention have been described, it should be un- 
derstood that the present invention may be embodied 
in many other specific forms without departing from the 
spirit or the scope of the invention. By way of example, 
steps involved with deoptimizing a given frame may be 
reordered, removed, or added. In general, steps in- 
volved with the methods of the present invention may 
be reordered, removed, or added without departing from 
the spirit or the scope of the present invention. 
[0068] Substantially all frames in a stack may be 
deoptimized, i.e., vframe arrays may be created for sub- 
stantially all of the frames in a stack. Alternatively, a sin- 
gle vframe array may be created to represent a plurality 
of frames in a stack. Since vframe arrays are machine 
independent, deoptimizing an entire stack and, hence, 
creating vframe arrays for the entire stack, enables the 
entire stack to be exported to any suitable computer sys- 
tem. In other words, machine independent vframe ar- 
rays may be shared between different computer sys- 
tems. Machine independent vframe arrays may be 
stored on a medium such as a disk for a later restart of 
execution. Alternatively machine independent frames 
may also be transmitted to another computer, as for ex- 
ample a computer with a different architecture, where 
execution may continue after the vframe arrays are un- 
packed into an interpreted control stack, as described 
above. 

[0069] In one embodiment, extracting interpreter 
states from an optimized frame, then placing the inter- 
preter states into a vframe array may involve the use of 
both static and dynamic interpretations. That is, scope 
descriptors may be obtained using the program counter 
associated with the optimized frame; and dynamic infor- 
mation may be obtained substantially directly from the 
optimized frame. The scope descriptors, which include 
parameter variables, and the dynamic information, 
which includes actual values for the parameter variables 
of the scope descriptors, may then be used to generate 
vframes. The vframes may then be serialized into a 
vframe array. 

[0070] While the use of an individual vframe array for 
each frame which is to be deoptimized has been de- 
scribed, it should be appreciated that a plurality of 
frames that are to be deoptimized may also share a 
vframe array. By way of example, when frames which 
are to be deoptimized are adjacently located on a stack, 
the interpreter activations associated with each of the 
adjacent frames may be migrated to a shared vframe 
array without departing from the spirit or the scope of 
the present invention. 

[0071] Although the deoptimization of optimized 
frames has been described in terms of deoptimizing a 



compiled frame into an interpreted representation of the 
compiled frame, in one embodiment, the deoptimization 
of optimized frames may instead be applied to the deop- 
timization of frames compiled with a certain level of op- 

5 timization into frames compiled with a lower level of op- 
timization. In other words, a method compiled with a 
high level of optimization may be "uncompiled" to reflect 
a lower level of optimization. Reducing the level of op- 
timization for an originally compiled method typically re- 

70 leases memory space associated with the compiled 
method. When the level of optimization associated with 
a compilation is reduced, a vframe array may include 
less-optimized compiled activations of an originally 
compiled method. Therefore, the present examples are 

75 to be considered as illustrative and not restrictive, and 
the invention is not to be limited to the details given here- 
in, but may be modified within the scope of the append- 
ed claims along with their full scope of equivalents. 

20 

Claims 

1. A computer-implemented process for deoptimizing 
a compiled method, the method being in a first state 

25 of optimization, the method further being associat- 
ed with a frame located on a control stack, the com- 
puter-implemented process comprising: 

creating a data structure, the data structure be- 
30 ing separate from the control stack, wherein the 

data structure includes information related to 
the method, the information being obtained 
from at least one of the method and the frame; 
creating a reference indicator, the reference in- 
35 dicator being associated with the frame, where- 

in the reference indicator is arranged to refer- 
ence the data structure; and 
deoptimizing the method to a second state of 
optimization after the data structure is created. 

40 

2. A computer-implemented process as recited in 
claim 1 wherein the data structure is machine inde- 
pendent, and deoptimizing the method includes de- 
leting the method in the first state of optimization. 

45 

3. A computer-implemented process as recited in any 
one of the preceding claims wherein the second 
state of optimization is an interpreted state. 

50 4. A computer-implemented process as recited in any 
one of the preceding claims wherein the frame is 
located in a middle section of the control stack. 

5. A computer-implemented process as recited in any 
55 one of the preceding claims wherein decompiling 
the method includes deoptimizing an optimized ac- 
tivation included in the method. 
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6. A computer-implemented process as recited in 
claim 5 further including storing the deoptimized ac- 
tivation in the data structure. 

7. A computer-implemented process as recited in 
claim 6 further including: 

creating a new frame using the deoptimized ac- 
tivation stored in the data structure: and 
pushing the new frame onto the control stack 
in place of the frame. 

8. A computer-implemented process as recited in any 
one of the preceding claims wherein the data struc- 
ture is machine independent, the computer-imple- 
mented process further including transferring the 
data structure from a first computer system in the 
direction of a second computer system. 

9. A computer-implemented process as recited in 
claim 8 wherein the data structure is created on the 
first computer system, the computer-implemented 
process further including creating a new frame on 
the second computer system using the deoptimized 
activation stored in the data structure. 

10. A computer-implemented method for dynamically 
deoptimizing a first frame on a control stack, the first 
frame having an associated compiled method, the 
compiled method including a compiled activation, 
the computer-implemented method comprising: 

deoptimizing the compiled method associated 
with the first frame, the first frame being located 
beneath a second frame on the control stack, 
wherein deoptimizing the compiled method in- 
cludes deoptimizing the compiled activation, 
deoptimizing the compiled method further in- 
cluding creating an interpreter equivalent of the 
compiled activation; 

removing the second frame from the control 
stack; 

creating at least one interpreter frame from the 
interpreter equivalent of the compiled activa- 
tion; and 

pushing the at least one interpreter frame onto 
the stack in place of the first frame. 

11. A computer-implemented method as recited in 
claim 10 further including storing the interpreter 
equivalent of the compiled activation in a data struc- 
ture, wherein the data structure is associated with 
the first frame. 

12. A computer-implemented method as recited in 
claim 11 further including deleting the data structure 
after the at least one interpreter frame is created. 



18 

13. A computer-implemented method for dynamically 
deoptimizing a compiled method, the compiled 
method being associated with a first frame selected 
from a plurality of frames on a control stack, the con- 

5 trol stack including a stack pointer arranged to iden- 
tify a current frame on the control stack, the com- 
puter-implemented method comprising: 

creating a data structure, the data structure be- 
70 ing separate from the control stack, the data 

structure including interpreter information relat- 
ing to the first frame and method information 
relating to the compiled method, wherein the 
first frame includes a reference to the data 
75 structure; 

deleting the compiled method: 
unpacking the interpreter information included 
in the data structure when the stack pointer 
identifies the first frame as the current frame on 
the control stack, wherein unpacking the inter- 
preter information creates at least one inter- 
preter frame; 

pushing the at least one interpreter frame onto 
the control stack, wherein the first frame is re- 
placed by the at least one interpreter frame. 

14. A computer-implemented method as recited in 
claim 13 further including: 

accessing access information associated with 
the compiled method; and 
placing the interpreter information into the data 
structure, wherein placing the interpreter infor- 
mation into the data structure includes access- 
ing the interpreter information using the access 
information. 

15. A computer-implemented method as recited in 
claim 1 4 wherein placing the interpreter information 
into the data structure includes: 

extracting the method information from the 
compiled method; 

extracting the interpreter information from the 
first frame; 

creating at least one interpreted activation us- 
ing the interpreter information and the method 
information; and 

migrating the at least one interpreted activation 
into the data structure. 

16. A computer-implemented method as recited in any 
one of claims 13-15 wherein the compiled method 
is deleted before the interpreter information is un- 
packed. 

17. A computer-implemented method as recited in any 
one of claims 13-16 further including creating a mi- 
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grating frame on the stack, wherein the migrating 
frame is arranged to associate the at least one in- 
terpreter frame with a second frame selected from 
the plurality of frames on the control stack. 

5 

18. A computer system for deoptimizing a method, the 
method being in a first state of optimization, the 
method further being associated with a frame locat- 
ed on a control stack, the computer system com- 
prising: 70 

a data structure, the data structure being sep- 
arate from the control stack, wherein the data 
structure includes information related to the 
method; ^5 
a reference indicator, the reference indicator 
being associated with the frame, wherein the 
reference indicator is arranged to reference the 
data structure; and 

a deoptimizer arranged to deoptimize the frame 20 
to a second state of optimization, wherein the 
second state of optimization is less optimized 
than the first state of optimization, the deopti- 
mizer further being arranged to deoptimize the 
method to a second state of optimization. 25 

19. A computer system as recited in claim 18 wherein 
the data structure is machine independent, and the 
deoptimizer is further arranged to deletethe method 

in the first state of optimization. 30 

20. A computer system as recited in one of claims 18 
and 19 wherein the second state of optimization is 
an interpreted state. 

35 

21. A computer-readable medium including computer 
program code devices arranged to deoptimizing a 
compiled method, the compiled method being in a 
first state of optimization, the compiled method fur- 
ther being associated with a frame located on a con- 40 
trol stack, the computer-readable medium compris- 
ing: 

computer program code devices that create a 
data structure, the data structure being sepa- 45 
rate from the control stack, wherein the data 
structure includes information related to the 
method; 

computer program code devices that create a 
reference indicator, the reference indicator be- so 
ing associated with the frame, wherein the ref- 
erence indicator is arranged to reference the 
data structure; and 

computer program code devices that deopti- 
mize the method to a second state of optimiza- 55 
tion. 
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